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LISTING OF THE CLAIMS 



(including amendments, if any) 



1. (currently amended) A method for representing statistics about a table including one or 
more rows, each row including a respective value, the method including: 

creating zero one or more histogram buckets, each histogram bucket including a width 
representing a respective range of values and a height representing a count of 
rows in the table having values in the range of values; and 

creating one or more high-bias buckets, each high-bias bucket representing including 
one or more high-bias values up to a maximum number of high-bias values (F) 
that appear in a minimum percentage of rows in the table and for each high-bias 
value a number of rows that contain the high-bias value ; 

repeating the following: 

(a) determining an average height of the histogram buckets; 

(b) determining a reclassification threshold based on the average height of 

the histogram buckets; and 

(c) concluding that a value associated with one of the one or more histogram 

buckets occurs in more rows of the table than the reclassification 
threshold, and, in response, concluding that the number of high-bias 
values associated with at least one of the one or more high-bias 
buckets has not reached the maximum number of high-bias values, 
and, in response, including the value in one of the high-bias buckets 
for which the number of high-bias values has not reached the 
maximum number of high-bias values; 
until no values included in any of the ranges of values associated with the histogram 

buckets occur in more than the reclassification threshold number of rows in 

the table; and 

performing query optimization based, at least in part, on one or more of the zero 
saving in a memory the width and the height of each of the one or more 
histogram buckets and the one or more high-bias values and numbers of rows 
for each of the one or more high-bias buckets. 
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2. (currently amended) The method of claim 1, where a total number of buckets is a 
fixed number equal to the sum of the number of histogram buckets and the number of high-bias 
buckets is less than a predetermined number . 

3. (cancelled) The method of claim 1, where creating the high-bias and histogram buckets 
includes: 

(a) determining an average height of the histogram buckets; 

(b) based on the average height of the histogram buckets, determining a reclassification 

threshold; and 

(c) representing each value that exceeds the reclassification threshold in a high-bias 

bucket. 

4. (currently amended) The method of claim [[3]] 1 , where the reclassification threshold is 
equal to the average height of the histogram buckets multiplied by (1+S), where S is a positive 
percentage represented as a decimal. 

5. (cancelled) The method of claim 3, where (a), (b), and (c) are repeated until no value 
exceeds the reclassification threshold. 

6. (cancelled) The method of claim 1, where creating the high-bias and histogram buckets 
includes: 

(a) determining an average height of the histogram buckets; 

(b) based on the average height of the histogram buckets, determining a reclassification 

threshold; and 

(c) for each value that exceeds the reclassification threshold: 

(1) if all of the high-bias buckets are not full, representing the value in a high-bias 
bucket; 

(2) else, if the number of high-bias buckets is less than a fixed number of high-bias 
buckets: 

(i) creating a new high-bias bucket; and 

(ii) representing the value in the new high-bias bucket. 
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7. (cancelled) The method of claim 6, where the reclassification threshold is equal to the 
average height of the histogram buckets multiplied by (1+S), where S is a positive percentage 
represented as a decimal. 

8. (cancelled) The method of claim 6, where (a), (b), and (c) are repeated until: 

(i) no value exceeds the reclassification threshold; or 

(ii) a number of the high-bias buckets is equal to the fixed number of high-bias buckets 



9. (cancelled) The method of claim 1, where a total number of buckets is equal to the sum 
of a number of histogram buckets and a number of high-bias buckets, where the total number of 
buckets is fixed, and where the method further includes: 

(a) identifying one or more values that appear in at least the minimum percentage of rows 

and representing the identified values in the high-bias buckets; 

(b) determining a remaining number of buckets equal to the total number of buckets 

minus the number of high-bias buckets used; and 

(c) if the number of remaining buckets is greater than a stop number of buckets: 

(1) adjusting the minimum percentage of rows; 

(2) identifying values that appear in the adjusted minimum percentage of rows; and 

(3) representing values that appear in the adjusted minimum percentage of row in 
high-bias buckets. 

10. (currently amended) The method of claim [[9]] 1, where (a) includes further 




including setting the minimum percentage of rows to FB , where F is equal to a number of 



and each of the high-bias buckets is full. 




tch high - bias bucket 



contain and B is equal to the total number of 



buckets. 
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11. (currently amended) The method of claim [[9]] 1, where (c)(1) includes further 
including adjusting the minimum percentage of rows setting the adjusted minimum 

V(FB - 1) 

percentage to — i '-%, where F is equal to a number of high - bias values that each 

FB 

high - bias bucket can contain, B is equal to the total number of buckets, V is equal to the 
minimum percentage of rows, and I is equal to a number of values represented in high-bias 
buckets. 

12. (cancelled) The method of claim 9, further including: 

(d) if the number of remaining buckets is less than or equal to the stop number of 

buckets: 

representing values not represented in high-bias buckets in histogram buckets. 

13. (cancelled) The method of claim 12, further including: 

(e) repeating (b), (c), and (d) until the number of remaining buckets is less than or equal 

to the stop number of buckets. 

14. (cancelled) The method of claim 1, where a total number of buckets is equal to the sum of 
a number of the histogram buckets and a number of the high-bias buckets, where the total 
number of buckets is fixed, where the number of high-bias buckets is fixed, and where the 
method includes: 

populating the one or more high-bias buckets with the FH most frequently occurring 
values, where F is a number of values each high-bias bucket can store and H is the 
number of high-bias buckets; and 

populating the one or more histogram buckets with all other values. 
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15. (currently amended) A database system including: 
a massively parallel processing system including: 
one or more nodes; 

a plurality of CPUs, each of the one or more nodes providing access to one or more 
CPUs; 

a plurality of data storage facilities each of the one or more CPUs providing access to 

one or more data storage facilities; 
P partitions, each partition residing on one or more data storage facilities; 
a process for representing statistics, where the database system represents statistics about 
a table including one or more rows, each row including a respective value, the 
process including: 

creating zero one or more histogram buckets, each histogram bucket including a 
width representing a respective range of values and a height representing a 
count of rows in the table having values in the range of values; and 

creating one or more high-bias buckets, each high-bias bucket representing 
including one or more high-bias values up to a maximum number of high- 
bias values (F) that appear in a minimum percentage of rows in the table and 
for each high-bias value a number of rows that contain the high-bias 
value ; 

repeating the following: 

(a) determining an average height of the histogram buckets; 

(b) determining a reclassification threshold based on the average height 

of the histogram buckets; and 

(c) concluding that a value associated with one of the one or more 

histogram buckets occurs in more rows of the table than the 
reclassification threshold, and, in response, concluding that the 
number of high-bias values associated with at least one of the one 
or more high-bias buckets has not reached the maximum number 
of high-bias values, and, in response, including the value in one of 
the high-bias buckets for which the number of high-bias values 
has not reached the maximum number of high-bias values; 
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until no values included in any of the ranges of values associated with the 
histogram buckets occur in more than the reclassification threshold 
number of rows in the table; and 

performing query optimization based, at least in part, on one or more of the zero 
saving in a memory the width and the height of each of the one or more 
histogram buckets and the one or more high-bias values and numbers of 
rows for each of the one or more high -bias buckets. 

16. (currently amended) The database system of claim 15, where a total number of 
buckets is a fixed number equal to the sum of the number of histogram buckets and the number 
of high-bias buckets is less than a predetermined number . 

17. (cancelled) The database system of claim 15, where the process creating the high-bias 
and histogram buckets includes: 

(a) determining an average height of the histogram buckets; 

(b) based on the average height of the histogram buckets, determining a reclassification 

threshold; and 

(c) representing each value that exceeds the reclassification threshold in a high-bias 

bucket. 

18. (currently amended) The database system of claim [[17]] 15, where the reclassification 
threshold is equal to the average height of the histogram buckets multiplied by (1+S), where S is 
a positive percentage represented as a decimal. 

19. (cancelled) The database system of claim 17, where the process creating high-bias and 
histogram buckets includes repeating (a), (b), and (c) until no value exceeds the reclassification 
threshold. 
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20. (cancelled) The database system of claim 15, where the process creating high-bias and 
histogram buckets includes: 

(a) determining an average height of the histogram buckets; 

(b) based on the average height of the histogram buckets, determining a reclassification 

threshold; and 

(c) for each value that exceeds the reclassification threshold: 

(1) if all of the high-bias buckets are not full, representing the value in a high-bias 
bucket; 

(2) else, if the number of high-bias buckets is less than a fixed number of high-bias 
buckets: 

(i) creating a new high-bias bucket; and 

(ii) representing the value in the new high-bias bucket. 

21. (cancelled) The database system of claim 20, where the reclassification threshold is equal 
to the average height of the histogram buckets multiplied by (1+S), where S is a positive 
percentage represented as a decimal. 

22. (cancelled) The database system of claim 20, where the process creating high-bias and 
histogram buckets repeats (a), (b), and (c) until: 

(i) no value exceeds the reclassification threshold; or 

(ii) a number of the high-bias buckets is equal to the fixed number of high-bias buckets 

and each of the high-bias buckets is full. 

23. (cancelled) The database system of claim 15, where a total number of buckets is equal to 
the sum of a number of histogram buckets and a number of high-bias buckets, where the total 
number of buckets is fixed, and where the process creating the high-bias and histogram buckets 
further includes: 

(a) identifying one or more values that appear in at least the minimum percentage of rows 

and representing the identified values in the high-bias buckets; 

(b) determining a remaining number of buckets equal to the total number of buckets 

minus the number of high-bias buckets used; and 

(c) if the number of remaining buckets is greater than a stop number of buckets: 
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(1) adjusting the minimum percentage of rows; 

(2) identifying values that appear in the adjusted minimum percentage of rows; and 

(3) representing values that appear in the adjusted minimum percentage of row in 



high-bias buckets. 



24. (currently amended) The database system of claim [[23]] 15, where (a) include? 



further including setting the minimum percentage of rows to FB , where F is equal to a 
number of high - bias values that each high - bias bucket can contain and B is equal to the total 
number of buckets. 

25. (currently amended) The database system of claim [[23]] 15, where (c)(1) includes 
further including adjusting the minimum percentage of rows setting the adjusted minimum 

percentage to — D.%, where F is equal to a number of high - bias values that each 

FB 

high - bias bucket can contain, B is equal to the total number of buckets, V is equal to the 
minimum percentage of rows, and I is equal to a number of values represented in high-bias 
buckets. 

26. (cancelled) The database system of claim 23, where the process creating the high-bias and 
histogram buckets further includes: 

(d) if the number of remaining buckets is less than or equal to the stop number of 

buckets: 

representing values not represented in high-bias buckets in histogram buckets. 

27. (cancelled) The database system of claim 26, where the process creating the high -bias and 
histogram buckets further includes: 

(e) repeating (b), (c), and (d) until the number of remaining buckets is less than or equal 

to the stop number of buckets. 

28. (cancelled) The database system of claim 15, where a total number of buckets is equal to 
the sum of a number of the histogram buckets and a number of the high-bias buckets, where the 
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total number of buckets is fixed, where the number of high-bias buckets is fixed, and where , 
where the process creating the high-bias and histogram buckets further includes: 

populating the one or more high-bias buckets with the FH most frequently occurring 
values, where F is a number of values each high-bias bucket can store and H is the 
number of high-bias buckets; and 
populating the one or more histogram buckets with all other values. 

29. (currently amended) A computer program, stored on a tangible storage medium, for use 
in representing statistics in a database running in a partitioned parallel environment including P 
partitions, each partition residing on one or more parallel processing systems, the database 
including a first table including one or more rows stored in one or more of the P partitions, the 
program including executable instructions that cause a computer to: 

represent statistics about a table including one or more rows, each row including one or 
more values, the program further causing the computer to: 
create zero one or more histogram buckets, each histogram bucket including a width 
representing a respective range of values and a height representing a count of 
rows in the table having values in the range of values; and 
one or more high-bias buckets, each high-bias bucket representing including one or 
more high-bias values up to a maximum number of high-bias values (F) that 
appear in a minimum percentage of rows in the table and for each high-bias 
value a number of rows that contain the high-bias value ; 
repeat the following: 

(a) determine an average height of the histogram buckets; 

(b) determine a reclassification threshold based on the average height of 

the histogram buckets; and 

(c) conclude that a value associated with one of the one or more 

histogram buckets occurs in more rows of the table than the 
reclassification threshold, and, in response, conclude that the 
number of high-bias values associated with at least one of the one 
or more high-bias buckets has not reached the maximum number 
of high-bias values, and, in response, include the value in one of 
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the high-bias buckets for which the number of high-bias values 
has not reached the maximum number of high-bias values; 
until no values included in any of the ranges of values associated with the 
histogram buckets occur in more than the reclassification threshold 
number of rows in the table; and 
perform query optimization based, at least in part, on one or more of the zero 
save in a memory the width and the height of each of the one or more 
histogram buckets and the one or more high-bias values and numbers of 
rows for each of the one or more high -bias buckets. 

30. (currently amended) The computer program of claim 29, where a total number of 
buckets is a fixed number equal to the sum of the number of histogram buckets and the number 
of high-bias buckets is less than a predetermined number . 

31. (cancelled) The computer program of claim 29, including executable instructions that 
cause the computer to: 

(a) determine an average height of the histogram buckets; 

(b) based on the average height of the histogram buckets, determine a reclassification 

threshold; and 

(c) represent each value that exceeds the reclassification threshold in a high-bias bucket. 

32. (currently amended) The computer program of claim [[31]] 29, where the reclassification 
threshold is equal to the average height of the histogram buckets multiplied by (1+S), where S is 
a positive percentage represented as a decimal. 

33. (cancelled) The computer program of claim 31, including executable instructions that 
cause the computer to repeat (a), (b), and (c) until no value exceeds the reclassification threshold. 
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34. (cancelled) The computer program of claim 29, including executable instructions that 
cause the computer to: 

(a) determine an average height of the histogram buckets; 

(b) based on the average height of the histogram buckets, determine a reclassification 

threshold; and 

(c) for each value that exceeds the reclassification threshold: 

(1) if all of the high-bias buckets are not full, represent the value in a high-bias 
bucket; 

(2) else, if the number of high-bias buckets is less than a fixed number of high-bias 
buckets: 

(i) create a new high-bias bucket; and 

(ii) represent the value in the new high-bias bucket. 

35. (cancelled) The computer program of claim 34, where the reclassification threshold is 
equal to the average height of the histogram buckets multiplied by (1+S), where S is a positive 
percentage represented as a decimal. 

36. (cancelled) The computer program of claim 34, including executable instructions that 
cause the computer to repeat (a), (b), and (c) until: 

(i) no value exceeds the reclassification threshold; or 

(ii) a number of the high-bias buckets is equal to the fixed number of high-bias buckets 

and each of the high-bias buckets is full. 

37. (cancelled) The computer program of claim 29, where a total number of buckets is equal 
to the sum of a number of histogram buckets and a number of high-bias buckets, where the total 
number of buckets is fixed, and where the computer program includes executable instructions 
that cause the computer to: 

(a) identify one or more values that appear in at least the minimum percentage of rows 

and representing the identified values in the high-bias buckets; 

(b) determine a remaining number of buckets equal to the total number of buckets minus 

the number of high-bias buckets used; and 

(c) if the number of remaining buckets is greater than a stop number of buckets: 
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(1) adjust the minimum percentage of rows; 

(2) identify values that appear in the adjusted minimum percentage of rows; and 

(3) represent values that appear in the adjusted minimum percentage of row in high- 
bias buckets. 

38. (currently amended) The computer program of claim [[37]] 29, where (a) includes 

further including executable instructions that cause the computer to set the minimum percentage 

-U 

of rows to FB , where F is equal to a number of high - bias values that each high - bias 
bucket can contain and B is equal to the total number of buckets. 

39. (currently amended) The computer program of claim [[37]] 29, where (c)(1) includes 
further including executable instructions that cause the computer to adjust the minimum 

percentage of rows set the adjusted minimum percentage to V(FB — /) ^ w } iere F is equal 
~ FB 

to a number of high - bias values that each high - bias bucket can contain, B is equal to the 

total number of buckets, V is equal to the minimum percentage of rows, and I is equal to a 

number of values represented in high-bias buckets. 

40. (cancelled) The computer program of claim 37, further including executable instructions 
that cause the computer to: 

(d) if the number of remaining buckets is less than or equal to the stop number of 

buckets: 

represent values not represented in high-bias buckets in histogram buckets. 

41. (cancelled) The computer program of claim 40, further including executable instructions 
that cause the computer to: 

(e) repeat (b), (c), and (d) until the number of remaining buckets is less than or equal to 

the stop number of buckets. 

42. (cancelled) The computer program of claim 29, where a total number of buckets is equal 
to the sum of a number of the histogram buckets and a number of the high-bias buckets, where 
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the total number of buckets is fixed, where the number of high-bias buckets is fixed, and where 
the computer program includes executable instructions that cause the computer to: 

populate the one or more high-bias buckets with the FH most frequently occurring values, 
where F is a number of values each high-bias bucket can store and H is the 
number of high-bias buckets; and 
populate the one or more histogram buckets with all other values. 
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